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rne writer of a raaaonably long piece of expository 
^ prose usually discusses a lauffife^r of topics In praeentlng the 

general subject of the passage. Part of the reader^s und©r» . 
r*^ standing of what the writer Is trying to say InTolTes pereelTlng 

Q these topics. At presentt little la known about the stimulus 

attributes of texts that readers use In detecting the toploa 
presented by the writer. The purposes of the present study were 
(a) to develop a method for describing the topical organisation 
perceived by readers and (b) to look for text features that ' 
detarnjlne the reader's perception of topical orgamlEatlon. 



Method of Describing Topical Organl^mtlon 

TSie method for describing topical organisation Is 
based on readers • Judgments of the topical relatedness of 
seiitenceaf rather than on a content analysla of the meaning 
of sentences. Headers mapped topics onto text sentences by 
Identifying sentences that were topically similar. Then an 
hierarchical clustering analysis of the matrix of reader responses 
was used to determine clusters of topically identical sentences, 

to ; > ^ 

((] Thm details of the method are as follows, 

% 1, Mapping of Topics onto Sentences, 

V5 A. Reading, A reader was given a passage and told to read 

. ^ 2 
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the pftBSise earefully, Thm reader was told that he would be 
«sk©d to sumBarlz® the topics dlaoussad In the pMsage after he 
had finished reading It* 

B, Topic genemt Ion, After reading the pas sage once, the 
reader was told to reTlew the pasBag© and write down froii five 
to nlno toples* A topic was defined for the reader mm a ganeral 
desorlptlon of something the writer dlseussed. The roader was 
also given an example of a topic. To encourage the reader to 

describe topics elekrly, he was told that another person would 

■ \' 

try to locate the topics In the passage. Wie reMer wrote down 
his topics on separate 3x5 oaWs which were collected by tttie 
experlttenter* The reader could consult the passage while 
generating topics, 

C. Topic Identification, The reader identified the sentences 
that discussed each of the topics he generated, K»e reader was 
given his own topic cards, one at a time, along with the 
passage and was told to underline all the senfeenoea that had to 
dp with that topic. The reader's topi© cards were given to him 

in a haphazard order obtained by shuffling the oarda. Each 
tine the reader was given a card he was also given a fresh eopy 
©f the pasflsge, Bils allowed the reader to assign a sentence 
to nore than one topic, 

g, ]^ta aftalyslflt 

The readers* reflponBes were entered into a matrix 
raprefltntihg all possiblt asatenee pftlrs in a passage, A cell 
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entry In this reapens© matrix la the n}mber of readers who 
tmderllned both sentenees of a pair to Indicate that they 
belongecl to the same topic . The following two maasures of • 
topleal similarity were then derived from this reBponse matrix « 

(1) Fr«quenoy measwre ..of similarity. IhlB measure la 
the numter of readers who Indloated that two sentenceB ware 
toploally related, mia measure of similarity la s Imply the 
cell entry In the response matrix for a aentenoe pair. 

(2) Pro^blllty meaattre of slmllaplty. This measure la the 
cell entry In the responae matrix walghted by the number of 
readers who'aiaJened the two ^Bentonoea to topics. TSila measure 

18 the geonetrlo mean COM) of two oontlngent prol»bllltlea i 

GM - P{[.W|m3/si) • PCLn,^/n) 

where PCCn.^Q/-^) protoaMllty that sentences n and 

^ '^''"-m ar® asfllghea to the mame topic given that 

sflsxtence m jla aBfflgned to a topic i 

P([n»m]/n| Is the probability that n and m are 

esslgJMd to the atmrn topic given that 
aentonee m la assigned to a topic i 
ljn,m^ 18 the number of readers asslgnlns n 

and m to the aam# topic i 
n Is th« number of readera assigning 

ssntenca n to topicBi 
m la the number of roaders assigning 

sontenoe m to topics ■ 

-Ihffi two slmllarlfcf natrlosa deplved from a response matrix 
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aiffar from aaeli ©ther in that the level of similarity batwseii 
two aentariots for tha f rsquenoy meafiure Is partly detaralnad 
by the number of aubjaots who aaslgwed the aentenoee to fcopies. 



TtiiB la' not the oaee for the probahlllty measure* 

/ ^ ' . 

3* Hlersrchlaal CliAsterlng Analyelt* 

An hiei^rchlDal olusterlng anaiyels was used to determine 
the topleal organisation of a passage* mils analysis finds the 
sentences that are cloaast together at eaoh level of topleal 
similarity In a slfflllarlty aatrl^. The analysis results In 
a tree struetura. At the top of the. tree are olustera of sentanoea 
that are hlshly similar to eaoh other. Clustara 'at lower levels 
of the tree represent decreasing levels of similarity between 
sentences* Thm tree struoture Is hlerarehloal. Sentenees that 
form a oluster towards the top of the tree are all membars of 
the same cluster at lower levels of the trae^ Aleot a sentence 
cannot be a member of more than one olustar* 

Often data that ^re being analyEed are not truly 
hlerarehloally organised. Johnson U967) has developed rilsorlthms 
for retrieving unclerlylng hlerarehloal structures* Johnson -s 
ma^ilmum aethod was chosen for the present analysis of topical 
organli^atlon* Thm maxlffium method Is an Iterative procedure 
for joining clusters together until only one cluster remains s 
©lis method only forma a new cluster when mil the similarities 
between cluster sentences are at or ateve a minimum value* 
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This mlnlmim Talus defines a cluster IbvbI of a tree. ^^^All cl\^±ers 
formlns at a eluster level have the same ffilnlmum. slallarlty value. 

Any similarity matrix can be analyzed with the maslmum 
methods TOierefore, a scheme Is needed for evaluating the goodness 
of-flt of the: tree structure to the data being analy^eds 
Halff {1975) has suggested a graphical evaluation technique # 
Halff propos^?^. that an Index of goodnass-of-flt be ealoulated 
for each cluster level of a tree. This index Is the pi^oportlon 
of. unclustered sentence pairings, that "Should have been, clustered 
at a given cluster lavsl. This Indez would be zero at all 
cluster levels If the data were truly hierarchically structured, 
-Therefore, goodness^of -f It can be evaluated by looking at the 
deviation of the Index from zero at each oluster level* 
Halff also proposed that a value of the Indes can be eslculated 
that represents tho worst fit for each cluster level, ThiB is 
the expected valua of the index If sentences were clustered 
together at random. 

Halff 's evaluation procedure was used In the present 
study to decide how well the readers* perceptions of topical 
^elatedness were described by the hierarchical clustering 
analysis. Using this procedure, It was also possible to decide 
which of the two measurda of similarity produced the better 
tree structure for a passage. 

6 • 



Applloatlon of the Clusfcerlns Taehnlque 

The oluatt \ng taohnlquo was used to detamlna the 
the topical organization of two prose passages • Two tree 
;sfcructur#a were derived for each passas^i Qne was based on the 
Ifrequency measure of similarity and the ©ther, on the probability 
tmeasiire of similarity. 

Method . " . ' 

Materials Tsfo passages were taken from a publication 
containing teohnlcal articles of general Interest. One 
paasage (GOP) discussed gopher dMage to telephone cables j the 
other passage (NOD) discussed pay telephone service. Each 
passage was approximately 2000 words long. Biere were 97 
sentences In GOP divided into 24 paragraphs i there were 99 
sentences In NOD also divided Into 24 paragraphs,- Both 
passages were fairly difficult ( Flesch Reading lase Score 
was 50.7 and 50.8 for GOP and NOD, respectively). 

Subjects. Jud^ents of toplbal re latedness were made 
by two groups of readers , 

(1) Twenty-eight paid, volunteer high school students. 
Each student only saw one of the passages, 

(2) Thirty-two paid, volunteer college undergraduates, 
^ese readers also served In Experlnent 1 , to be reported. 
They; read the publlshad version of one passage and an altered 
version of ttim other passage. Only Judpnents for the published 



versions of a passage were used* Sixteen studente ganeratad 
toplos for GOP and 16 generated topics for NOD. " - 

He suits 

The response matrices produced by the two groups of 
readers ,were compared with each other* The Pearson product- 
moment 'correlation between the matrloes Indicates that the 
two groupo of readers were In general agreement about the 
topical relatednesr of the sentences (r^ •82 and .93 for GOP 
and NODp respectlirely ) • This result suggeste that the method 
for desorlblng topical relatedness yields reliable results* 

The response matrices of the two groups were combined 
Into a single matrix for each passage, "^ese combined matrices 
were used to derive the frequency and probability measures of 
Inter-sentence similarity/ Each similarity matrlaE was then 
subject to an hierarchical clueterlng aimlysls* 

The tree structures representing the topical organization 
of a passage are shoHn in Pig. 1-^. PigUTes 1 and 3 are the , 
tree structures for GOP and NOD derived from the probability 
measures of similarity. Figures 2 and 4 are the tree struetures 

Inser| Fig . _1 , _|^_2_and_4_about_here , 

for GOP and NOD derived from the freq.uenoy measure of; similarity. 
The oluster- levels and their similarity values are shown In 
the 4ef.t-hand coliiffln of each figure. Sentence numbers are 
listed a^^ of the figure. The foMilLtlon of a new ' 
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Mmntmnom eliister from an old eluster is indicated ty a horizontal 
dashed 11ns oonnectlne the new santenees entering a cluster 
with the Tertlcal line coming down from the old cluster. 

In order to damonBtrate how theae tree structures relate 
to the content of a passage , Tables 1 and 2 g Its verbal descriptions 
of *he content of the clusters for GOP and NODt respectlTely . 

Insert '^bles l_and^2_about^here# 

A brief summary of each sentence In a passage la shown, ©le 
sentences are organised as they are In the tree structures, 
derived from the pro'teblllty measures (Flgi 1 and J). Sentence 
nuffibars are shown In the left margin of the page. The lines 
connecting sentences correspond to clusterings at dlffarent 
levels of the tree. The letter labels correspond to the letteirs 
labeling each major branch of the trees In Pig. 1 and 3. The 
sentenbe numbers and brackets In the right marglne of the two ... 
tables represent the tree structuras derived from the frequency 
measures of -slmliarlty (Fig. 2, and 4). ^Thls Information was 
inoluded In the tables to help In comparing the two tree 
structures for a single passage. 

The two dlfferant measures of similarity produced 
slightly different tree structures for each passage. Clusters 
differed primarily In the clustering levels at which specific 
sentences Joined clusters, rather than in which sentences formed 
clusters. This Is Illustrated by the letter labels on the 
figures* Each major branch of the probability-measure trees 
for GOP and NOD has been assigned a letteri Hiese letters were 



then uead to latsl Gorrefipondlng parts of the frequenoy ••measure 
trees* For GOPp four of the flTe major branches In Fig. 1 
correspond to major branches in Fig* 2 One Bajor branch (E) . 
in Pig* 1 apllt Into two branches In Pig, 2. For NODs four of 
the six major brancheg in Pig. 3 correspQnd to major branohea 
In Pig, ^» Parts of two branches (C and .D) in Plg» 3 torm a 
single branch In Pig. 4. 

ETaluation of Goodnass-of-f it . SHalff 's goodness«of-f it 
Indioes ware calculated for all trees and used to determine 
which measure of aloilarity produced th# better tree stricture 
foi a passage. The indices and the estimates of the worat 
fit are plotted in P^^.~5-8. These plots correspond to the 

tree structures in Pig. 1-^* Halff's index remains reasonably 
near zero for all tree structures except at low levels of similar* 
ity. This finding suggests that the topical organisation retrieved 
by the clustering technique describes the similarity data 
reasonably well, 

. Specific comparisons between similarity measures for 
both passage^ Indicate that the tree structure derived from a 
probability measure matrix provides a better description of the 
readers' perception of the topical organlMtlbn of a passage 
than the tree atructure derived from a^ frequency measure matrix. 
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For both OOP and NODt the daTlatlons of the goodnaas^ef-f It 
_liidio©6 from Earo apptar to be lasa for the pro^blltty maasur© 
data (Flg« 5 and 7) th^ for the fraquanby meaaura data ^ v 
(Pie. 6 and 8). . ^ . 

Discussion - 

Kne hlerarohlcal olueterlng aimlysle ganarated olustaM 
of seffismtieally ralated sentenDas. Thasa oltwtaM appeared to 

! . ^ V ' . ■ . 

be ralatad to thaisubjaots dlsDUseed In the passasas. In addition . 
the cluatarlng analyels producad traa atruetures that deeorlbad 
the almllarlty data for santanoas raaaonably well. These find- 
ings Indicate that tha clustering method Is a ¥alld method for 

i ■ _ 

describing tha readers* perception of toiplcal organization. ^/ . 

j ^ . 

Thus, clustars of topically similar sentences can be ratrlevad 

without evaluating the verbal labels that readers use to describe . 

topics* 

Exploration of Text Features that DateiWlne Topical Organization 

Experiment 1 

topical organizations for GOP and NOD (Fig* 1 and 
3) were compared with clusters of sentences that had been fomed 
from sentences that shared content words* Wils comparison was 
part of an effort to find text features that would p^redlct topical 
organization. ThB co-occurrence of content words in sentences 
was analyzed because It was thought that writers might tend to 
the same words when discussing a topic. If spp sentences 



that were similar In their use of content words should b© topically 
related. > 

j The comparison revealed little relationship to#tween 
clusters taeed on topical similarity and clusters ^sed^n 
lexical similarity. One reason for this lack of correspqnditnoe 
was that topically similar aeiitencas fended to te near each other 
In the text while lexloally similar sentences could be located 
anywhere In the text. It was hypothesized that spatial ^ro^lmity 

might have an Influence on the reader's perception of topical 

\- ^ ' . ~^ ■ ■ ^ . ' ^^'l ' ' ■ 

relatednass. In that casep lei^lcally similar aentenoes ; would 

be perceiTed as topically similar If these aentences were near 

each other. To test this hypothesis p the GOP and NOD passages 

were rewritten* A few lexically similar sentences tl»t were 

far apart in the original texts were moved so that they were - 

adjacent to ^iach other in the rewritten teite. Each pair was 

chosen so that the sentences of a pair were lexically highly 

\ similar and topically Tery dissimilar. It was predlct€>d that / 

the sentences would become topically similar In the rewritten 

^. " . ^ ^ . " ■ ■ \ ^ ^. ■ ■ 

passages. . - 

.Method 

Material. Both the original and rewritten versions 
of the GOP and NOD^exts were used. The texts were rewritten 

/. ^ ^ ^. . . . . 

SO that nine sentences were moved in each passage* A moTea 
sentence was placed next to the sintenca with which it was lex- 
ically similar p but topically dissimilar • Topical dissimilarity 
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was inaux^ed by pleklng sentenet pairs that cama together In a 
eluatfir at low levels of the tree reprBsentlng the topical 
prganlzation of a passage i Leacloal similarity was Insut^^ 
pioklng sentenees that olustei^d together at hll^ l^Tels of th©^^ 
tree representing the lexloal organiMtlon of a p III© v 

lexleal organization of a passage was determined from a measure - 
of lexteal similarity. Thtm measure was baset on both the ^ | 
numbey of eontent words shared by two aentenees and the number 
of content words In the two sentanees that OQ-^o^aurred In other ^ 
s©nten0es. Tb.a Index of oommonallty (^es©, 1965) was used to p 
detemlne the degree d£ lexical similarity* ' - * 

When a sentence waa moved , all lanaphoral words were 
;replaced ,wlth their referents. In ad^l ton, the tens© of the 
moved eentenee's varb was changed If It conflicted with the verbs 

In th© sentences adjacent to the moved\ sentence. 

~ " ■ ^ \ ■' -■" - - ^- ^ % n •■ ■ 

ftfooedure* The versions of a passage seen by a reader 
during th© topic geheratlon and topic idehtlflcatioh phases of* 
the mapping procedure were yarl©d. Some readers saw th© sane 
version of the text In both phases (Op « or Iglna^ version of 
t©xt during both phases t RE - rewritten version during both 
phases). Other reiAers saw ona^verslpn of the text while 



geneimtlng topics and the other v 
sentences (OR - original version 
versldri during Identification I RC 
geneimtlon and original version during Identification) . 

Thm design for the experiment Is shown In fcble 3. 
Insert Table 3 about here. ; 



ere Ion while Identifying topic 
during generation and rewritten 
• rewritten ^version during 



g^iJSC;^ ' Each ^reader mapped ttoples onto sentences for both passages 

WM::':. : \ ' " ■ - . - ' / . . .-, ■ 

®li/i>^. one passage t the readfer waa given a 

1^^ ; before taapplng the toplos for the other .passage; " 

: r ' Subjeots* /Sl%t7*four pald> Tol^ 

graduites Were/assigned at random to one of^the oondltlona ■ 

shown In lable 3. . / , 

Hesults - \ 

Rie mean similarity measures for the nine les^loally 
similar senteriees are shown In ^ble « ^ ^ deterailn© whether 

; . Insart^bla ^ about here. - 

"i- = ■ ■ = _ ^ - ■ ■ ^ - - = ■ ' ' ■- - 

these senten08S| beoame toploally similar when the sentences . 
were adjacent to each otherp the means- for condlttona ^0^ 
and RO were compared with the meaps for condition 00 # A 
' t-test for eorrelated means was usbdt 

/The lexically similar. sentencfs became 
when moTed naxt to each other during the topic Identification 

phase of the mapping procedure. For both passages, topical 

' ^ - ^ . , ^ ' ' ^. ■ ■ . iL^ ' " ' - " 

similarity was .slgnlf Icantay greater when the sentences were 

/adjacent (t f 8.91 arid 9 y76r df - 8,' p < .05 for the OR and RR 

: ^^d^ltl^ GOPi t - slsa and 5.84p df ^ 8, p < .65 for the 

OR and RR conditions of NOD). 

.-^^^^^ The topical similarity between pairs did not incrtasis 

when readers aaw the rewritten texts only during topic ^ 

generation (condition HO). For GOPi^^ sentences actually became. 

less topically similar (t ^ 2,96f df ^ 81 p-<.05) while for 



NOD, there virtually no change in topical similarity (t<l). 

Discussion /■ . ■ |. .\ , ■ ' ^ 

- The finding that use of the rewritten tfeit during 

topic generation had little effect on topical similarity le 
not eurprlsing. making the generation phase of the mapping 
procedure, readers were only concerned with sumisarl zing the , 
subjects discussed 'oy the writer, Apparently, the dlspls^^enrt^ 
of a very few sentences (less than 10^) did not disrupt the 
reader's ability to understand what the writer dlseussed, ^^^^^^^^ 

It was during topic Identlflca^tlon that sentence 
proximity had Its effect on topical similarity. During this , 
phase of the mapping procedure* readers have to malce Judgments 
about each sentence. In keeping with the i findings of Doollng 
and Laelman U971) among others, the results of this study show 
that the topical context of a sentence played an Important 
role in the reader's Interpretation of the sentence's meaning, 

• Experiment 2 • 

Whiles spatial proximity and the perception of topical 
relatedneBs are strongly related, this Is not tevarlably the, 
case. Detailed eiamlimtlon of the tree structures for both 
passages Indicated that the first sentence of ^ new paragraph 
and the last sentence of the previous paragraph were often In 
different clusters. This was especially so for clusters with 
high levels of similarity, These sentences at paragraph 
boundaries seemed to be perceived as topically less similar 
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to each other than setttencea within a paragr^ 

■ f 'Shim o^Br^mti:m that the perception of 

topical organization wa« influenced, by the typographical con- 
vent lori^ofraiTldini a pa Into paragraphs, Thli espsrlment 
evaluated the influence of paragraph structure on ^oploal 
organization by comparing passages with and without paragraph 
Indentation. Beadera ^appear to use paMgraph . Indaritatlon as 
one cue that a heif topic has begun. It was hypeitheslsed that 
sentences at paragraph boundaries (l.,e.», the first sentence of 
a- paragraph and the last sentence of t^e previous i>aragraph) -would 
be perbalTsd as aore topically similar whan the Indantatlon 
cue to toplo ahain$e was rsmoTrad* ; 

Method , ■ ' ' , - " ^ . .- 

Material. The original. Indentfd GOP and NOiD passages 
and versions of these passages without paragraph ihd'entatlon ^ 
were used. Each original .passage was divided int© p^ragr^phs . 
Thus, 2i*, of the 97 sentences- In GOP were indented ik the original 
versloh, and 2k of th^ 99 sentenoeb In NOD were Indehted In the 

original version. . \ 

\ Procedure, Readers mapped topics to ew/tences for 
both the GOP and NOD texta* Half of the readersysaw the GOt text 
first while half saw the NOD text first. After /napping topics 
for one passage, the reader was given a snort ^est before mapping 
the topics for the other passage. , ^ 

Headers used the Indented versions of the texts during , r 
Jbhe topic generation phase of the napping procedure. The unlndeXted 



Tension was used during tha topic Identification phasa. I^hls 
was don© beoauaa wa wanted readers to generic ta toplc^ uwdar 4 
optisal conditions for pereelvlnff topic boundarlei i In this 
way we could batter assess the effect of remoTlrig a cue to 
these boundaries on the perception of topical rilatednes^s* 
Also p the topical similarity data from the f Irst etoidy reported 
In this paper we» used to eTaluiite ch^geji In similarity ^d^ 
to tha ramoTal of indentation. Therefql^ep It ^ _ 
was dasir Able foT raaderg "in this experlmant to ienerata toplos 
under the same conditions as readers In the first study . 



Subjects » light ein paid , Toluntear ^%igh school 
students participated Ih tha axparlment. - 1/ ^ 



Results . r \ ;* 'ff 

Kie response mat3Pl% producad:^^ the teadars In :^e 



present eiperliant was compared wl^^ reeponae matrix for^ 



the 36 readers In the M^t study /^ ;t^ Pearson product-moment 
correlations indicated that'tha^e was janeral agreement between 
the two groups of readers (r ^ .90 and .80 for GOP and NODp 
respectively). Also, the tree structures resulting from a 
clustering analysis of the probability maasure matrices for this 
axperim.ent closed resembled the tree structures for the first 
study shown in Plg». 1 aftd 3* ^ ' 

The finding of little dlffe^^nce between the topical 
organization perceived by the two groups could be expected. 
Readers in both groups generated topics under Identical conditions 
and renoval of Indentation dwrlng topic identification only 



Affeotad 2^* santftmas. The gaiiaMLl sgraemsnt ©f the gTOiipg.- ; 
iiioTiflnks tha oonoiuslon that tha prakaht taeh^tqua ifor atudying^^^^^ 
topical organisation produced rfillmblt ra8ul|^^ 



;|-'..7?.. ■ - = ^. • i--.. , .• .. - «S . : -V . . , . , .,-"-V^r-5N^ 

Kia af f aot of raito^liiji pia;»graph IMan^tlo^ on tha ^ ' '^^^ 
I;. ©Iffillarlty of aanttnoes at iwragmph^ boundarlaa la Jhown Itt ^^^^ 3 

55&bla 5# This tabla shows the mean pwMWlJlty riatfsu . r ^^i 

Insert Tfcble 5 abpiit £er# w ^ -/^"^ 
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\^ Similarity (a) for the adjaoeiit sentaiv^aii at^pwmgrap^ 
(lia»9 the santaneas @Lt the and of on^ paj^g 



of the next) and (b) for tha ddjaoant gentariias ifil^ 
boundaries. ^ ^ ^ " 



When yeadarst utied the Indentad verblohs of > it 



adjapanf sentanoes at pW^jmph/^^jDH^^ 

ad Jaoant^antenoas: within paragraphs <t ^ ^ll,06i df ^ p4 sOS I 
for GOPi t ^ 12^10, df it 96, p< .05 for NOD)V When, paragraph ■ 
Indantatldn was S^amoTed 9 ■ tea^^rs . ifitlll pa3^dalyedj^h^^^ 
santaneas at paragraph btfu^arles:. to -be ;iasa ;f Imliari 
ad Jao ant santaneas within paragraphs (t «^»63t df * 94. p^ •05 :S| 
for GOPi t ^ 6,779 df i p < *05 for NOD), Howe wv^^ t^ 
was a tandaney for parftgtffiph*boujf^a:ry iantanoas^ be pereelYad^;:^^^^^^ 
as mora similar when^paimgraph Indentation Was ^ m 
whan Ihdantatlpn was ^ prasant , ThXm Inoraase; In; slmliarlty w^s 
statistically significant for both passages when aTaluatad 
with t-tasts for oorrelatad means (t = 2.58 wid 2,?8.for GOP 
and NODt df - 22, p < *05)* This Ineraasa In similarity for . 



the senteneeii at papagrsiph boundaries contrasts with a deoresse 
tn the a Inllarlty between adjaoent sentences within paragraphs. 



v Tn© rasults of this axpsrllaasitt ^ugsecit that pAragraph 



Inda^tatlo^ Is only on© cue that refers us© in>^et©rmlnlns 
-topleal tooundirlafl t STon when pawis»ph lnd©ntaM#^ 
ytad^ra atlll found ad-Jae^^rit S©nt©ifiC©s at paragraph >^undarl©s 
l©ss similar than adjaoent seht©^ wlttiln jiaimgraphft^ 
result 14 oonaistent With th© flndina of Ko©a ©t al (I969) 
that readers/ eah identify the first sentence of a ^a^g^aph 
with greater^than-Qhanee aoauracy when paragraph Indentation 



Jiaa beeri remoTed* The results of the present study suggest 
that paragraph indentation acts to ©nhane© 4he eff pots of t 
syntaotlo and semantlo cues to topical ^undaries. 



GeneWil Discussion^ . 
The study presented a method for determining the reader fs 
perception of the topical orgamiatlon of a passage. The 
hierarchleal elusterlng analysl^s produces clusters of seMhtlqally 
related sentences that seem to reflect the subjects discussed 
hy the writer* £n addltlont the method yields jreproduclble ^ 

results* . / / j \ ■ ; 

This clustfering method^of ret^leTl^ topical organisation 

waa deTlsfd to help in the search for text f^ readers 
use to underatand what they read* Vhm results' of Ixperlment l/ 



shoired that the laxieal almliai^ity of senteneeg may lead readers 
to peroQlTe that two santenoes are toploally similar If the 
sentanoes are olose together* ^Is f * tiding suggests th^ 
spatial relationship of sentenoes is an inportant datamlnant 
of tha peraeptlon of toploallty. Pjrhaps one of th^^^^^ / 
strategies used by readers is to assume! that spatially adjaoent ^ ' 
sentanoe'3 belong to the sue toplo mnlei^s the writer signals ^ ^ 
the reader that a new topic has begun • Experiment 2 prasantied 
aTldanqe that one suoh signal may be the typographloal o^^^ 
of paragraph Indentation. ^ i^: . 

Ihe hierarohloal elusterlng method pi^rXiLmm a reliable 
way 6f oharaoterlslng thii organisation peroelTed in a piffisaga • 
Having a metrlo to dasorlbe^" organisation oan be^ v^ ^to 
both writers and reiearohers. This olusteriAg^^ ft^^^ 
a way to map toploal organlMtlon on tezts and^^ t^^^^^ 

sequenplng of topics In a passage. Using this method^ It should ' 

be possible to eTaluatfi '^how alternatlTe ways of organizing 

affect readers from both a practical and thaoratlcal' point of ¥lewr 
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, VerBal dascrlption of the iGontent of the v ' 
sentences of the clusters for -She clusterljig' 
^ analysis ^of GOP. Sentences & the same cluster 
are bracketed together with stralgh€ lln^s. 
A dlamorid, ^ encloses ^the number of each v ^ 
santenca that Is tha first 'teantanoe of a 
paragraph. 



intFeduf tloa of gapher and hQW he de^tro^i eablas, 

fii^at, itudy:! t imi, place, gepheri^ - ' ./ A 

digression atiout ^firit itii^j dei^^liva i \ ; * 

pratactlah usad prlOT to -flrgt itu^i ©ther natert^is 
aiiD tfled in .stw^# !; 
pain conelualto of study 1, ' 
'tMagi have dhanged sia'oe first study, ^ 

niw ayaluati©n planned far I96S with ebjabtlvas 
deieribad, - . 
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14,15 



17 new avaiuatlon, 'who^whare, when, what. 



19, Interesting facts emergeA from intense study of oeqi^ 



desGriptioii ©f gephaf; lengthj weighty habitat ^ and sisa ^0,21 
of appatlta, . ^ . ' y ^_ . 

geogw-phiaal looatlon* ^ Sa^ 
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prbblam of Ineisor growth for aating, 
may laad to. death, . ^ 

3jief£iStiveneis of ehemical repellents beaausa thay iS 
dapand on taste. 
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17)30 loeatlon iJialda mouth means ean ehaw suiythiagi 
sharpnass of biting surfaca. 
(2^31 .. hardnass of iurfaeai large biting force results In 
anormous presmire, = ' \ , ; ^ ' y 
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teumeratlon of t^as a^d thictoaisai ©f eitla armor 
in, isiz^lM, 



diamatar of eabl^a «d wlras in aafflpla, ^ 
h% ganaiml deso^iptiai of sa^laa of oablasjWlrer 

and protaatlva mterial inoluding roUan^ rapelliinlis 
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armor and shiald t^es uiad with -^imm* 
(4^46 . dasGription burled wira mid their shiald and armor. 45)46 



Q3)33#34 dasipi-of oafasiwho^ mat arial^ details of dail®i for 
V aseparimants 

35 how tait matarlals^ mount ad iji ^aga. 
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^,37 bow lOTg jan^lai laft In .aagajno nead foi^i£centl,ve 36,37 
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38 htmgar not a faotor in ehewlng, / . .0 38 
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58,59 maj^ffluffi Jaw ©pealag and how liU'ge cables can be 58,59 
/ be attacked, . . , 

61 field ifepQrts lupport mlriifmaffl cable dlamater for 6l#6i 

no daiaasep ' . ' 

6^,63 field T^fivt of ai£a and loaaticn of lai'gtst 63 
cable dansagti 



6^,65 cehitructlon cf cable sheath unlir^artejiti 



66 



dlicusslon cf steal armored cable,, 
S6 . result s no need to iolder steel armored cable * 



^51,52 mm plastic offered prctectlOTi plastlo to prevent ^51,5^ 
Gprrpelsn did not work,' tl^ctoeases of lead 
and alUffilniMi that did not work* ' 



50 other plastics, even thoae with reptile failed, 50 



5^#55 '\i description of most of other metals ttat were eff active 5^#55. 

statement that mo it metals, were effective, v p3S ' 



^^68 all armored cable/ of fered pTOtectlon but corrosion ®5#^^ 
^ of armor if penetration of ^ plaatic cover because 

. afror then leas resistant, ^ ^ ^ 

/ ■ . . - - ^ 

6g corroslOT wkmi normal thictaies^ of arror not always Sg ^69 



78,79 i^ortance of typa of toterrial itraeture of wirei If, TO 
air space la eipeciaia^r had, 

80 one id.re/not damged had no air ipaee in core. _8Q 
77 degree/of overlap of windings had ho effect* 77 
n^m ' in general, wires did not perform well. 

75,^ odd^result for two wires |daseription of difference 75,76. 
inwlndtog, 

81 conclusion about wire consiruetion based nn results 81 

having to do with lolld core. 

■ 2,4 ■ . ■ ' ■ . ' 
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©Id theo^ about toratlon of need for a&bX# 

prot©ctl©n,v U 
gr^^md oo^skoted, but damage reporbid. after 6 ^ears * 71 
protaotlaa nj^tdsd - for life of cable j not Just 5 years v 



best armom f or buried ifire is itajileia :iteiil.:ur^^ 
dlsadvi^tage of stainless steel is - tfcit i% ktt^ot 8ft ; 
lightning I lia^tad use to ierrtce: Inattad^^^^o 



reason for sarviae wire resistwoerto liglifea^ 

opposed to rural wire i reaso^ndation for type 
"of rural wire* 

wh^ this type is best for rtsist'jjig lightnteg = 
, damage* > ^ * 

Denver stud^ only the j first phasfe of s"tadiei*: 
reason for field .teat* 

reasona fos^ ohoiee of field test area aiid the 
nature ^f field test* ' - 



85*86" 
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90j93 . place of testiuse af test results. 
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91,92 
90,93 



gipilf leant Ob serrations Bade to date, 
plastl^es vs. metals, 

prospects, i •, 
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Verto&i description' Of .the content Of th^^^ . m;.: 
sentences of the clusters for the ciusterlrig; ; 
anaiysla of WOT. ' Sentences in the: same; 'dl^ 

are "bracketed together with straight >iihes^:! ^^^ 
A diamond ^ encloses -the number of each 
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sentence that Is the first sentence of a 
paragraph. , 

hlstarieal IntMduatlon - aoin telephona itrvlce* 
aMOBqpilihmtnta, — = . . 

' rtvlew. '©f a isrvlQa ^ . ^ 

IsproveMiit pragma dafihed* 

pM'pay (dtpQeit) aallijig linae ISteO's, 

e^ECSptlQtt tQ pre-pay galling ^ 

■ deicrlptlon ©tf p©at«pay operation Mid Its 
dliad^vmntagsi^ ^ 

dlal-t@na-flrat a maj©r faatttre in^ro^eiflenti 
ftaturei dsftoed. ' 

without dial toni don*t toow If phsaa li worMiig 

before dapQsit ef Drohey* 
risk of iosi-'of-D^ney avoided with dlal^tone-flrst, 

type of oalls with dlal^tona-flpit made without 
deposit * . 

oan dial without depositing for certain asllii 

tooludlng ealli requiring opei^tor aselitanee. 
proaedhire for regular laeal call* . = 

details of proeedure for oorreGtlng deposit errori* 
. l^roveaent ^rogms offers batter volge trmimission 
estimation of how and wl^. 
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modlf ioations not the on^ problem* 

holding timm Inareasea due ta time to deposit aoln kO 

after dial tone pro7lded|a].50 need to redlal. ^ 
aoin calls handled hy BBS sli^larS^r take more tlms« 

£ me additional equipment needed In eentral office - 37,38 

e^su^le of holding ttm of oroisbaF, 
more landers and regiateriimore coin^contrdl qlrculta ^1>43 
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heoauae of holding tl^ ESSi 



transfer of panel affiee servioe to ISi or aroiabar. 30 
aq^pmant daslpii available to nbdif^ crosibar, S9 
effiges I step-by .-step off laea 3n 1970» 



31 



four key coii^onents of erossbar modlf iQatloni 

line eirTOltSj . - 

senders J markers* 

circuits jESS conversion via profriim* 

changes in cord switchboards* 
conversion of pbst-pay offices* 

comprd^ses to minlod.ie costa at coin telephones 

- ■ , ,;"^^-7-''"'' / - so 

main modlf leat Ion to nultfilot for dial-tone -first | k9 
none for new universal single - slot, . 

modifications of multl- and iingle-slot desoribedi, 5^52 
same J different , ■ V 
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Ideally appl icable to all caiitral ©fficeEj 
EWltahboardE and stations « 
26^27 but changes overfall substantlaljtharafore there. 26,27 
ara eospromlses* 

' ohmgas in sl^alllng batween teiaphonas wid .central 
©ffiea rifecesiary. 

5^^5S . hew signalling ©peratas GUrrantly, ^^^^^ 
56#57 how gi©ialllng will operate with dial-t©ne-f Irit , 56^57 

^^59 description of eon^lieation to cantral of floe ^^59 
because must verify Initial deposit if not 
required, 

fiOjSl central offices differ Jji whan and ray to deelde 60 
to co^leta or deny caliibut same check for cotos. 

" , ' 61 

62 ■universal colji telephone posalble with central 62 

office. . X ^ " ■ 

^^64 addition of 448-volt battery and use In- central office, ^^64 

when batteiy conneeted with cord awitoh, ^^66 

66 this precludes Touch^Tone, , * 

67#68 "lii future, fewer cord ewitehboards, thus limit on ' . 67#68 
toich-fone will lessen. 
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trials Qf dial-tone -first, 

first trial - where - Eartford, = ^ 

renditions of M.rtford trial* 

puipose of tortf ard trial, 

;itandard davalopad as rasult of Hartford trial 
triad In Danville, 



4 

70 



^#72 

73' 



85,86 
80 



5,79 



81 



88 



75.76 



purpose of Danvllie triaL 
□onditione of Daavlllt trial, 
method of ©peimtlni In Grttnwlgh trial, 
Oreenwieh YlUage trlaa* 

oondltions of qraenwleh trial. - ^ 

reaotlon to Greenwloh trtol, ^ 

Di^irtJ.le trial verified feasibility of overall 

pl^ and stsjidard method, 
traniitlon to dlal-tone as smoothly 5ji iwltahboard 

as in MP, 

feasibility ©f servloe eonflrMd in Itertford trial 

and good oustamer reaction, 
speolflci of results * posltiTe «id negative. 
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eost of conversion and schedule 
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saw the two passages, J . 
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The mean toploaj. si^larlty between eenteaees 
as a funotlon of spatial proxlirity during the 
topic generation and identification phases of 
the mapping procedure. 
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Table 5 , 
Mean probability measure of similarity 
between adjacent sentences at paragraph 
boxmiaries and adjacent sentences within 
paragraphs as a function of the presence 
of paragraph Indentation 



Version of Text 
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Topic Identification 


Adjacent Sentences 
At Paragraph 
Boundaries 


Adjacent Sentences 
, Within 
Paragraphs 


^dented 


.60 


.93 


Unindented 


,68 


.84 


NOD t 

^dented 


.59 


.93 , ' 


Unindented 


,67 


.88 



33 



Figure Captions 

Fig* 1 The topical organization of GOP resulting from an 
hierarchical clustering analysis of the similarity 
matrix based on the probability measure of similarity* 

Fig* 2 The topid^l organization of GOP re suit tog from 

hierarchical clustering analysis of the similarity 
mtrix based on the frequency measure of similarity , 

Fig* 3 The topical organization of NOD resulting from an ^ 
hierarchical clustering analysis of the similarity 
matrix based on the probability measure of slmllarltyV. 

Fig. 4 The topical organization of IfOD resulting from an 
hierarchical clustering analysis of the similarity 
matrix based on the frequency measure of similarity* 

Fig, 5 Graphical evaluation of the goodness -of -fit for the 

, GOP tree structure (Fig* 1) derived from the probability 
measure of similarity. 

Pig* 6 Graphical evaluation of the goodness-of-flt for the 

GOP tree structure. (Fig. 2) derived 'from the frequency 
measure of similarity. ' - , 

Fig. 7 Graphical evaluation of the gootoess«of-flt for the ; 

NOD tree structure (Fig. 3) derived from the ^ probability 
measure of similarity. 

Fig* 8 Graphical evaluation of the goodness-of-fit for the 

NOD tree structure (Pig* 4) derived from the frequency 
measure of similarity. 
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